Ultravox V0 6 Llama 3 3 70b
MIT
Ultravox is a large multimodal speech language model that combines a pre-trained large language model and a speech encoder, capable of handling both speech and text inputs.
Text-to-Audio
Transformers Supports Multiple Languages